Rec'dPcrynt i«dec 2004 

(12) INTERNATIONAL ApPBlCATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 

(19) World Intellectual Property 
Organization 

International Bureau 




(43) International Publication Date 
24 December 2003 (24.12.2003) PCT 



(10) International Publication Number 

WO 2003/106672 A3 



(51) International Patent Classification 7 : 
(21) International Application Number 



C12N 15/00 

PCT/JP2003/007514 



(22) International Filing Date: 

(25) Filing Language: 

(26) Publication Language: 



12 June 2003 (12.06.2003) 
English 
English 



(30) Priority Data: 

2002-171851 
2002-235294 



12 June 2002 (12.06.2002) JP 
12 August 2002 (12.08.2002) JP 



(71) Applicants (for all designated States except US): RIKEN 
[JP/JP]; 2-1, Hirosawa, Wako-shi, Saitama 351-0198 (JP). 
KABUSHIKI KAISHA DNAFORM [JP/JP]; 3-35, Mita 
1-chome, Minato-ku, Tokyo 108-00^3 (JP). 




(72) Inventors; and 

(75) Inventors/A&plicants (for US only): ^^YftSHIZAKI, 
Yoshihide [JP/JP]; 22-8, Inarimae, TsuRuba-§hi, Ibaraki 
305-0061 (JPAjCARNINCI, Piero [IT/JP]; cArIKEN, 
2-1, Hirosawa, Wako-shi, Saitama 351-0198 (JlfyllAR- 
BERS, Matthias, T. [DE/JP]; c/o KABUSHIKI KAISHA 



DNAFORM, 3-35, Mita 1-chome, Minato-ku, Tokyo 
108-0073 (JP). 

(74) Agents: OKUYAMA, Shoichi et ah; 8th Floor, Akasaka 
NOA Bldg., 2-12, Akasaka 3-chome, Minato-ku, Tokyo 
107-0052 (JP). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NI, NO, NZ, OM, PH, PL, PT, RO, RU, SC, SD, 
SE, SG, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, US, 
UZ, VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CH, CY, CZ, DE, DK, EE, 
ES, FI, FR, GB, GR, HU, IE, IT, LU, MC, NL, PT, RO, 
SE, SI, SK, TR), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GQ, GW, ML, MR, NE, SN, TD, TG). 



Published: 

— with international search report 



[ Continued on next page J 



(54) Title: METHOD OF UTILIZING THE 5*END OF TRANSCRIBED NUCLEIC ACID REGIONS FOR CLONING AND 
ANALYSIS 



1 st Strand cDNA Pool 



so 



o 



1 st Linker Ligation 
With Xma J! and Mme I Sites 

i 

2 nd Strand Synthesis 
Cleavage with Mme I 

2 nd Linker Ligation 
With Xba I Site 

i ■ ' 



Binding to Support 



PCR Amplification 
Cleavage with Xma Jl and Xba I 

i 

Purification of 33 bp Tags 

i 

Formation of Concatemers 



Cloning of Concatemers into Sequencing Vector 



Library of 5'-Sequence Tags 



(57) Abstract: A method is disclosed for obtaining the 5 'ends of transcribed regions from a plurality of nucleic acid fragments 
obtained from biological materials or synthetic pools. DNA fragments encoding the 5'ends are enriched for their individual analysis 
or for the analysis of concatemers thereof. The sequence information derived from 5' ends can be used for characterization and 
cloning of the transcriptome. 



WO%3/106672 A3 ■■IIIIMIIIIlim 



(88) Date of publication of the international search report: 

1 July 2004 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



WO 03/106672 

SEQUENCE LISTING 
<110> KABUSHIKI KAISHA DNAFORM 

<120> Method for utilizing the 5' end of mRNA for cloning and analysis 
<130> 1336 (PCT) 

<150> JP 2002-171851 
<151> 2002-06-12 

<150> JP 2002-235294 
<151> 2002-08-12 

<160> 77 

<170> Patentln version 3.1 

<210> 1 

<211> 74 

<212> DNA 

<213> Artificial 

<220> 

<223> First strand cDNA primer 
<220> 

<221> misc_feature 

<222> (73).. (73) 

<223> V is A, C or G 

<220> 

<221> misc_feature 

<222> (74).. (74) 

<223> n is any nucleotide 

<400> 1 aaggatcctg ccatttcatt acctctttct ccgcacccga catagatttt 60 



18/517544 

TPCT/JP03/07514 



74 

tttttttttt ttvn 

<210> 2 

<211> 60 

<212> DNA 

<213> Artificial 

<220> 

<223> Upper oligonucleotide GN5 
<220> 

<221> misc_f eature . 

<222> (56) : .(60) 

<223> n is any nucleotide 

Sgalac ctcgagtaac tataacggtc ctaaggtagc gacctaggtc cgacgnnnnn 60 

<210> 3 

<211> 60 

<212> DNA 

<213> Artificial 

<220> 

<223> Upper oligonucleotide N6 
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<220> 

<221> misc_feature 

<222> (55).. (60) 

<223> n is any nucleotide 

<400> 3 

agagagagac ctcgagtaac tataacggtc ctaaggtagc gacctaggtc cgacnnnnnn 

<210> 4 

<211> 54 

<212> DNA 

<213> Artificial 

<220> 

<223> Lower oligonucleotide 
<400> 4 

gtcggaccta ggtcgctacc ttaggaccgt tatagttact cgaggtctct ctct 

<210> 5 

<211> 55 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 5 , , 

agagagagac ctcgagtaac tataacggtc ctaaggtagc gacctaggtc cgacg 

<210> 6 

<211> 45 

<212> DNA 

<213> Artificial 

<220> 

<223> linker 
<400> 6 

tctagatcag gactcttcta tagtgtcacc taaagtctct ctctc 

<210> 7 

<211> 47 

<212> DNA 

<213> Artificial 

<220> 

<223> linker 



<220> 

<221> misc_feature 

<222> (46).. (47) 

<223> n" is any nucleotide 



<400> 7 

gagagagaga ctttaggtga cactatagaa gagtcctgat ctagann 



<210> 8 

<211> 25 

<212> DNA 

<213> Artificial 
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<220> 

<223> Primer 1 (uni-PCR) 

<400> 8 9 c 
gagagagaga ctttaggtga cacta ^° 

<210> 9 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 2(MneI-PCR) 
<400> 9 

agagagagac ctcgagtaac tataa '25. 

<210> 10 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> first strand oligo-dT primer 
<220> 

<221> misc_feature 

<222> (43).. (43) 

<223> V is A, C or G 

<220> 

<221> misc_feature 

<222> (44).. (44) 

<223> "n* is any nucleotide 

gagagagaga ggatccttct ggagagtttt tttttttttt ttvn 44 

<210> 11 

<211> 45 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide Bg-Gsu-GN5 
<220> 

<221> miscfeature 

<222> (41).. (45) 

<223> n* is any nucleotide 

<4 00> 11 L ,c 

agagagagaa ctaggcttaa taggtgacta gatctggagg nnnnn «o 

<210> 12 

<211> 45 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide Bg-Gsu-N6 
<220> 
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<221> miscjfeature 

<222> (40), -(45) % t#i 

<223> n* is any nucleotide 

agagagagaa ctaggcttaa taggtgacta gatctggagn nnnnn 45 

<210> 13 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide Bg-Gsu-down 

<400> 13 f x , ^ on 

ctggagatct agtcacctat taagcctagt tctctctct ™ 

<210> 14 

<211> 47 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide Bg-Mme-GN5 
<220> 

<221> misc_feature 

<222> J43) s . (47) 

<223> n is any nucleotide 

<220> 

<221> raisc_feature 
<222> (39). -(39) 
<223> V is G or A 

<400> 14 i j A7 

agagagagaa ctaggcttaa taggtgacta gatcttccra cgnnnnn 

<210> 15 

<211> 47 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide Bg-Mine-N6 
<220> 

<221> miscj:eature 

<222> (42) ..(47) 

<223> n is any nucleotide 

<220> 

<221> misc_feature 
<222> (39). .(39) 
<223> *r" is G or A 

<400> 15 A7 
agagagagaa ctaggcttaa taggtgacta gatcttccra cnnnnnn 

<210> 16 
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<211> 40 
<212> DNA 
<213> Artificial 

<220> 

<223> Oligonucleotide Bg-Mme-down 
<220> 

<221> misc_feature 

<222> (3).. (3) 

<223> *y* is C or T 

<400> 16 tjL . , , s , n 

gtyggagatc tagtcaccta ttaagcctag ttctctctct w 

<210> 17 

<211> 47 

<212> DNA 

<213> Artificial 

<220> 

<223> oligonucleotide 
<220> 

<221> misc_feature 

<222> (46).. (47) 

<223> V is any nucleotide 

gagagagaga ctttaggtga cactatagaa gagtcctgag aattcnn 47 

<210> 18 

<211> 45 

<212> DNA 

<213> Artificial 

<220> 

<223> oligonucleotide 

gaattctcag gactcttcta tagtgtcacc taaagtctct ctctc 45 

<210> 19 

<211> 49 

<212> DNA 

<213> Artificial 

<220> 

<223> linker 



<220> 

<221> misc_feature 

<222> (45). .(49) 

<223> n is any nucleotide 



<400> 19 

agagagagag cttagatgag agtgactcga gcctaggtcc aacgnnnnn 49 

<210> 20 

<211> 49 

<212> DNA 

<213> Artificial 
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<220> 

<223> linker 
<220> 

<221> misc_feature 

<222> (44) ..(49) 

<223> n is any nucleotide 

agagagagag cttagatgag agtgactcga gcctaggtcc aacnnnnnn 49 

<210> 21 

<211> 43 

<212> DNA 

<213> Artificial 

<220> 

<223> linker 

gttggaccta ggctcgagtc actctcatct aagctctctc tct 43 

<210> 22 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> second linker 

<400> 22 , ao 

gaattctacg cctctcgatc gaaatcccga tctaggctag eg 

<210> 23 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> second linker 

<400> 23 , A9 

ettaagatge ggagagcgtg aatcgagttt aaggctagca tc 

<210> 24 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 24 9 k 
ttagatgaga gtgactcgag cctag *° 

<210> 25 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 
<400> 25 
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ctacgatcgg aatttgagct aagtg 
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25 



<210> 26 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 26 

caggaaacag ctatgac 



<210> 27 

<2U> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> primer 

<400> 27 
gtaaaacgac ggccag 



<210> 28 

<211> 596 

<212> DNA 

<213> Homo sapiens 



17 



<220> 

<221> misc feature 

<222> (432|..(432) 
<223> n* is any nucleotide 

<220> 

<221> misc feature 

<222> (4367- -(436) 

<223> n" is any nucleotide 

<220> 

<221> misc feature 
<222> (4567.. (456) i#j 
<223> n* is any nucleotide 

tcgttaacta ttaggcgaat tgggccctct aggtcgacga gttctcagca gagccgccgt 
ctagagcccc gccctcccgg gccaccgtcg gacctagaat agttactcga ggtctctcgt 
cggacctaga gtttttcgta tgtttgtcat cgtcggacct aggtccgacg gtccattcct 
gagagtctct ctaggtccga cgagagagag aggatccttc tgtctagacc ctgacgccgg 
aaccgcaccg tcggacctag gtccgacgga aaagcagctt cctccactct aggtccgacg 
gtgtgtgtgt gtgtgcgtgt tctagagact ggttcagatc aaaagtcgtc ggacctaggt 
ccgacggggc tggtgagatg gctcagtcta gatgcatgct cgagcggccg ccagtgtgat 
ggatatctgc cnaatnccag cacaccggcg cgcgcnacca gtggatccga gcccggtacc 480 
aagcttgatg catacctcga gtatcctata ctgtcaccta aatagcttgg ggtaatcatg 540 
gtcatagctg tctcctgtgt gaaattgtta tccgctcaaa attcccaaca acatag 596 



60 
120 
180 
240 
300 
360 
420 
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<210> 29 

<211> 12 

<212> DNA 

<213> Artificial 

<220> 

<223> linker 
<220> 

<221> miscjfeature 

<222> (1)..(1) , tj 

<223> *n is any nucleotide 

<400> 29 1? 

nctaggtccg ac l£l 

<210> 30 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> linker 
<220> 

<221> misc_feature 

<222> (1}. : (1) , ... 

<223> n is any nucleotide 

<220> 

<221> miscjfeature 

<222> (20).. (20) 

<223> V is any nucleotide 

<400> 30 9n 
ngttggacct aggtccaacn cu 

<210> 31 

<211> 13 

<212> DNA 

<213> Artificial 

<220> 

<223> linker 



<400> 31 
tctaggtccg acg 



<210> 32 

<211> 13 

<212> DNA 

<213> Artificial 

<220> 

<223> linker 



13 



<400> 32 
cctaggtccg acg 

<210> 33 
<211> 20 
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<212> DNA 
<213> Artificial 

<220> 

<223> tagl 

<400> 33 on 

gtggcccggg agggcggggc * u 

<210> 34 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tag2 
<400> 34 

agagacctcg agtaactat 13 

<210> 35 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag3 
<400> 35 

atgacaaaca tacgaaaaac ' *v 

<210> 36 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tag4 
<400> 36 

gtccattcct gagagtctc 1S 

<210> 37 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag5 

<400> 37 9ft 
agagagagag gatccttctg M 

<210> 38 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag6 

<400> 38 „ 
gtgcggttcc ggcgtcaggg 6V 
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<210> 39 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tag7 

<400> 39 

gaaaagcagc ttcctccac 



<210> 40 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag8 

<400> 40 

gtgtgtgtgt gtgtgcgtgt 



<210> 41 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag9 

<400> 41 

acttttgatc tgaaccagtc 



<210> 42 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> taglO 

<400> 42 

gggctggtga gatggctcag 



<210> 43 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tagl 

<400> 43 

gtacctcctc gcatcccgc 



<210> 44 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag2 

<400> 44 

gtggtgtgcg tgtcgaaggt 
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19 



20 



20 



20 



19 
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<210> 45 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag3 

<400> 45 

atggaccgag ggccccagcc 



<210> 46 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tag4 

<400> 46 

cggatcgggt gggtcggac 



<210> 47 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tag5 

<400> 47 

agaggtcgca gcagttcgt 



<210> 48 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag6 

<400> 48 

tctccggagc cggcgctgtg 



<210> 49 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tag7 

<400> 49 

agactttgca ggctccgag 



<210> 50 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag8 
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20 



19 



19 



20 
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<400> 50 

ggagctgccg cagcgccgga 



<210> 51 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag9 

<400> 51 

acaccgtcgg acctggtcgc 



<210> 52 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> taglO 

<400> 52 

agacgttctc gcccagagtc 



<210> 53 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tagll 

<400> 53 

gccgttcctt gcttgctgga 



<210> 54 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tagl2 

<400> 54 

gggttgggga tttagctcag 



<210> 55 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tagl3 

<400> 55 

gagtaactat aacggtcct 



<210> 56 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 
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20 
20 
20 
20 
20 
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<223> tagl4 
<400> 56 

gattccgcct ggagctcgc 



<210> 57 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> tagl5 

<400> 57 

agggaccgct gcggtccg 



<210> 58 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> zzb21106i09t3 junk 

<400> 58 

cattagggga ttgggccc 



<210> 59 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> zzb21106i09t3 junk 

<400> 59 

acccgggggg cgggactaac cgtcggac 



<210> 60 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tagl 

<400> 60 

gacgcggaag gcgcggcggc 



<210> 61 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> tag2 

<400> 61 

ggagggcggg cggcggccct c 



<210> 62 

<211> 19 

<212> DNA 

<213> Artificial 
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19 



18 



18 



28 



20 
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<220> 

<223> tag3 
<400> 62 

caaaaaaaaa aaaaaaact 



<210> 63 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag4 

<400> 63 

aggctctgct cgctctgccc 



<210> 64 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tag5 

<400> 64 

acttctgatt ctgacagac 



<210> 65 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tag6 

<400> 65 

acagtggcgt ctgcaaagc 



<210> 66 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tag7 

<400> 66 

ggaaagtcca ggtggacttt 



<210> 67 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tag8 

<400> 67 

gccgccgagg ccgcgcagg 



<210> 68 
<211> 19 



PCT/JP03/07514 



19 



20 



19 



19 



20 
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<212> DM 
<213> Artificial 

<220> 

<223> tag9 

<400> 68 in • 

gttagtgtat aacagagtt 13 

<210> 69 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> taglO 

<400> 69 9ri 
tcgcccgctg ttcagtctct M 

<210> 70 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tagll 

<400> 70 1Q 
aggtggggca agatggctg ia 

<210> 71 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tagl2 
<400> 71 

ggcatggcca gaaggcaagc * u 

<210> 72 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> tagl3 
<400> 72 

gacgcacgca tagagggggg 6U 

<210> 73 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> tagl4 
<220> 

<221> misc feature 



<222> (1)..(D , ... 
<223> n* is any nucleotide 
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<400> 73 

nccatggaac agccacact 19 

<210> 74 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> junkl 
<400> 74 

tgataaggca atggcctcta atgctg 26 

<210> 75 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> tag 
<400> 75 

acctccctcc gcggag 16 

<210> 76 

<211> 2745 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (842).. (2017) 
<223> 

<400> 76 

acctccctcc gcggagcagc cagacagcga gggccccggc cgggggcagg ggggacgccc 60 

cgtccggggc accccccccg gctctgagcc gcccgcgggg ccggcctcgg cccggagcgg 120 

aggaaggagt cgccgaggag cagcctgagg ccccagagtc tgagacgagc cgccgccgcc 180 

cccgccactg cggggaggag ggggaggagg agcgggagga gggacgagct ggtcgggaga 240 

agaggaaaaa aacttttgag acttttccgt tgccgctggg agccggaggc gcggggacct 300 

cttggcgcga cgctgccccg cgaggaggca ggacttgggg accccagacc gcctcccttt 360 

gccgccgggg acgcttgctc cctccctgcc ccctacacgg cgtccctcag gcgcccccat 420 

tccggaccag ccctcgggag tcgccgaccc ggcctcccgc aaagactttt ccccagacct 480 

cgggcgcacc ccctgcacgc cgccttcatc cccggcctgt ctcctgagcc cccgcgcatc 540 

ctagaccctt tctcctccag gagacggatc tctctccgac ctgccacaga tcccctattc 600 

aagaccaccc accttctggt accagatcgc gcccatctag gttatttccg tgggatactg 660 

agacaccccc ggtccaagcc tcccctccac cactgcgccc ttctccctga ggagcctcag 720 

ctttccctcg aggccctcct accttttgcc gggagacccc cagcccctgc aggggcgggg 780 

cctccccacc acaccagccc tgttcgcgct ctcggcagtg ccggggggcg ccgcctcccc 840 

c atg ccg ccc tec ggg ctg egg ctg ctg ccg ctg ctg eta ccg ctg ctg 889 
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Met Pro Pro Ser Gly Leu Arg Leu Leu Pro Leu Leu Leu Pro Leu Leu 
15 10 15 

tgg eta ctg gtg ctg acg cct ggc ccg ccg gec gcg gga eta tec acc 937 
Trp Leu Leu Val Leu Thr Pro Gly Pro Pro Ala Ala Gly Leu Ser Thr 
20 25 30 

tgc aag act ate gac atg gag ctg gtg aag egg aag cgc ate gag gee 
Cys Lys Thr He Asp Met Glu Leu Val Lys Arg Lys Arg He Glu Ala 
35 40 45 

ate cgc ggc cag ate ctg tec aag ctg egg etc gee age ccc ccg age 
He Arg Gly Gin He Leu Ser Lys Leu Arg Leu Ala Ser Pro Pro Ser 
50 55 60 

cag ggg gag gtg ccg ccc ggc ccg ctg ccc gag gec gtg etc gee etg 
Gin Gly Glu Val Pro Pro Gly Pro Leu Pro Glu Ala Val Leu Ala Leu 
65 70 75 80 

tac aac age acc cgc gac egg gtg gec ggg gag agt gca gaa ccg gag 1129 
Tyr Asn Ser Thr Arg Asp Arg Val Ala Gly Glu Ser Ala Glu Pro Glu 
85 90 95 

ccc gag cct gag gee gac tac tac gee aag gag gtc acc cgc gtg eta 1177 
Pro Glu Pro Glu Ala Asp Tyr Tyr Ala Lys Glu Val Thr Arg Val Leu 
100 105 110 

atg gtg gaa acc cac aac gaa ate tat gac aag ttc aag cag agt aca 1225 
Met Val Glu Thr His Asn Glu He Tyr Asp Lys Phe Lys Gin Ser Thr 
115 120 125 

cac age ata tat atg ttc ttc aac aca tea gag etc cga gaa gcg gta 1273 
His Ser He Tyr Met Phe Phe Asn Thr Ser Glu Leu Arg Glu Ala Val 
130 135 140 

cct gaa ccc gtg ttg etc tec egg gca gag ctg cgt ctg ctg agg agg 1321 
Pro Glu Pro Val Leu Leu Ser Arg Ala Glu Leu Arg Leu Leu Arg Arg 
145 150 155 160 

etc aag tta aaa gtg gag cag cac gtg gag ctg tac cag aaa tac age 
Leu Lys Leu Lys Val Glu Gin His Val Glu Leu Tyr Gin Lys Tyr Ser 
165 170 175 



1369 



aac aat tec tgg cga tac etc age aac egg ctg ctg gca ccc age gac 1417 
Asn Asn Ser Trp Arg Tyr Leu Ser Asn Arg Leu Leu Ala Pro Ser Asp 
180 185 190 

teg cca gag tgg tta tct ttt gat gtc acc gga gtt gtg egg cag tgg 1465 
Ser Pro Glu Trp Leu Ser Phe Asp Val Thr Gly Val Val Arg Gin Trp 
195 200 205 

ttg age cgt gga ggg gaa att gag ggc ttt cgc ctt age gec cac tgc 1513 
Leu Ser Arg Gly Gly Glu He Glu Gly Phe Arg Leu Ser Ala His Cys 
210 215 220 

tec tgt gac age agg gat aac aca ctg caa gtg gac ate aac ggg ttc 1561 
Ser Cys Asp Ser Arg Asp Asn Thr Leu Gin Val Asp He Asn Gly Phe 
225 230 235 240 

act acc ggc cgc cga ggt gac ctg gee acc att cat ggc atg aac egg 1609 
Thr Thr Gly Arg Arg Gly Asp Leu Ala Thr He His Gly Met Asn Arg 
245 250 255 

cct ttc ctg ctt etc atg gee acc ccg ctg gag agg gee cag cat ctg 1657 
Pro Phe Leu Leu Leu Met Ala Thr Pro Leu Glu Arg Ala Gin His Leu 
260 265 270 

caa age tec egg cac cgc cga gee ctg gac acc aac tat tgc ttc age 1705 
Gin Ser Ser Arg His Arg Arg Ala Leu Asp Thr Asn Tyr Cys Phe Ser 
275 280 285 
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tec acg gag aag aac tgc tgc gtg egg cag ctg tac att gac ttc cgc 1753 
Ser Thr Glu Lys Asn Cys Cys Val Arg Gin Leu Tyr He Asp Phe Arg 
290 295 300 

aag gac etc ggc tgg aag tgg ate cac gag ccc aag ggc tac cat gec 1801 
Lys Asp Leu Gly Trp Lys Trp He His Glu Pro Lys Gly Tyr His Ala 
305 310 315 320 

aac ttc tgc etc ggg ccc tgc ccc tac att tgg age ctg gac acg cag 1849 
Asn Phe Cys Leu Gly Pro Cys Pro Tyr He Trp Ser Leu Asp Thr Gin 
325 330 335 

tac age aag gtc ctg gec ctg tac aac cag cat aac ccg ggc gee teg 1897 
Tyr Ser Lys Val Leu Ala Leu Tyr Asn Gin His Asn Pro Gly Ala Ser 
340 345 350 



gcg gcg ccg tgc tgc gtg ccg cag gcg ctg gag ccg ctg ccc ate gtg 
Ala Ala Pro Cys Cys Val Pro Gin Ala Leu Glu Pro Leu Pro lie Val 
355 360 365 



1945 



tac tac gtg ggc cgc aag ccc aag gtg gag cag ctg tec aac atg ate 1993 
Tyr Tyr Val Gly Arg Lys Pro Lys Val Glu Gin Leu Ser Asn Met lie 
370 375 380 

gtg cgc tec tgc aag tgc age tga ggtcccgccc cgccccgccc cgccccggca 2047 
Val Arg Ser Cys Lys Cys Ser 
385 390 

ggcccggccc caccccgccc cgcccccgct gccttgccca tgggggctgt atttaaggac 2107 

accgtgcccc aagcccacct ggggccccat taaagatgga gagaggactg eggatctctg 2167 

tgtcattggg cgcctgcctg gggtctccat ccctgacgtt cccccactcc cactccctct 2227 

ctctccctct ctgcctcctc ctgcctgtct gcactattcc tttgcccggc atcaaggcac 2287 

aggggaccag tggggaacac tactgtagtt agatctattt attgagcacc ttgggcactg 2347 

ttgaagtgcc ttacattaat gaactcattc agtcaccata gcaacactct gagatggcag 2407 

ggactctgat aacacccatt ttaaaggttg aggaaacaag cccagagagg ttaagggagg 2467 

agttcctgcc caccaggaac ctgctttagt gggggatagt gaagaagaca ataaaagata 2527 

gtagttcagg ecaggegggg tgctcacgcc tgtaatccta gcacttttgg gaggcagaga 2587 

tgggaggata cttgaatcca ggcatttgag accagcctgg gtaacatagt gagaccctat 2647 

ctctacaaaa cacttttaaa aaatgtacac ctgtggtccc agctactctg gaggctaagg 2707 

tgggaggatc acttgatcct gggaggtcaa ggctgcag 2745 

<210> 77 

<211> 391 

<212> PRT 

<213> Homo sapiens 

<400> 77 

Met Pro Pro Ser Gly Leu Arg Leu Leu Pro Leu Leu Leu Pro Leu Leu 
15 10 15 

Trp Leu Leu Val Leu Thr Pro Gly Pro Pro Ala Ala Gly Leu Ser Thr 
20 25 30 

Cys Lys Thr lie Asp Met Glu Leu Val Lys Arg Lys Arg He Glu Ala 
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35 40 45 

He Arg Gly Gin He Leu Ser Lys Leu Arg Leu Ala Ser Pro Pro Ser 
50 55 60 

Gin Gly Glu Val Pro Pro Gly Pro Leu Pro Glu Ala Val Leu Ala Leu 
65 70 75 80 

Tyr Asn Ser Thr Arg Asp Arg Val Ala Gly Glu Ser Ala Glu Pro Glu 
85 90 95 

Pro Glu Pro Glu Ala Asp Tyr Tyr Ala Lys Glu Val Thr Arg Val Leu 
100 105 110 

Met Val Glu Thr His Asn Glu He Tyr Asp Lys Phe Lys Gin Ser Thr 
115 120 125 

His Ser lie Tyr Met Phe Phe Asn Thr Ser Glu Leu Arg Glu Ala Val 
130 135 140 

Pro Glu Pro Val Leu Leu Ser Arg Ala Glu Leu Arg Leu Leu Arg Arg 
145 150 155 160 

Leu Lys Leu Lys Val Glu Gin His Val Glu Leu Tyr Gin Lys Tyr Ser 
165 170 175 

Asn Asn Ser Trp Arg Tyr Leu Ser Asn Arg Leu Leu Ala Pro Ser Asp 
180 185 190 

Ser Pro Glu Trp Leu Ser Phe Asp Val Thr Gly Val Val Arg Gin Trp 
195 200 205 

Leu Ser Arg Gly Gly Glu He Glu Gly Phe Arg Leu Ser Ala His Cys 
210 215 220 

Ser Cys Asp Ser Arg Asp Asn Thr Leu Gin Val Asp He Asn Gly Phe 
225 230 235 240 

Thr Thr Gly Arg Arg Gly Asp Leu Ala Thr He His Gly Met Asn Arg 
245 250 255 

Pro Phe Leu Leu Leu Met Ala Thr Pro Leu Glu Arg Ala Gin His Leu 
260 265 270 

Gin Ser Ser Arg His Arg Arg Ala Leu Asp Thr Asn Tyr Cys Phe Ser 
275 280 285 

Ser Thr Glu Lys Asn Cys Cys Val Arg Gin Leu Tyr He Asp Phe Arg 
290 295 300 

Lys Asp Leu Gly Trp Lys Trp He His Glu Pro Lys Gly Tyr His Ala 
305 310 315 320 
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Asn Phe Cys Leu Gly Pro Cys Pro Tyr lie Trp Ser Leu Asp Thr Gin 
325 330 335 

Tyr Ser Lys Val Leu Ala Leu Tyr Asn Gin His Asn Pro Gly Ala Ser 
340 345 350 

Ala Ala Pro Cys Cys Val Pro Gin Ala Leu Glu Pro Leu Pro He Val 
355 360 365 

Tyr Tyr Val Gly Arg Lys Pro Lys Val Glu Gin Leu Ser Asn Met He 
370 375 380 

Val Arg Ser Cys Lys Cys Ser 
385 390 
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